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Abstract 

In this paper we extend the classical notion of strong and weak backdoor sets for SAT and CSP 
by allowing that different instantiations of the backdoor variables result in instances that belong to 
different base classes; the union of the base classes forms a heterogeneous base class. Backdoor sets 
to heterogeneous base classes can be much smaller than backdoor sets to homogeneous ones, hence 
they are much more desirable but possibly harder to find. We draw a detailed complexity landscape 
for the problem of detecting strong and weak backdoor sets into heterogeneous base classes for SAT 
and CSP. 


1 Introduction 


We consider the propositional satisfiability problem (SAT) and the constraint satisfaction problem 
(CSP). Backdoors are small sets of variables of a SAT or CSP instance that represent “clever reasoning 
shortcuts” through the search space. Backdoor sets were originally introduced by Williams, Gomes, and 
Selman as a tool for the analysis of decision heuristics in propositional satisfiability. Since then, 

backdoor sets have been widely used in the areas of propositional satisfiability 

and also for material discovery , abductive reasoning [s^ , argumentation |l2|. quantified Boolean 
formulas [s^, and planning [ii l30l| . A backdoor set is defined with respect to some fixed base class for 
which the computational problem under consideration is polynomial-time tractable (alternatively, it can 
be defined with respect to a polynomial-time subsolver). The size of the backdoor set can be seen as a 
distance measure that indicates how far the instance is from the target class. One distinguishes between 
strong and weak backdoor sets; the latter applies only to satisfiable instances. Once a strong backdoor 
set of size k is identified, one can decide the satisfiability of the instance by deciding the satisfiability 
of at most “easy” instances that belong to the tractable base class, where d denotes the size of the 
domain for the variables; for SAT we have d = 2. Each of the easy instances is obtained by one of 
the d^ possible instantiations of the k variables in the backdoor set. Hence, the satisfiability check is 
fixed-parameter tractable llj for the combined parameter backdoor size and domain size {k -\- d). A 


similar approach works for weak backdoor sets, where the computation of a certificate for satisfiability 
(i.e., a satisfying assignment) is fixed-parameter tractable for the combined parameter {k d). 

The fixed-parameter tractability of using the backdoor set for deciding satisfiability or the compu¬ 
tation of a certificate for satisfiability triggers the question of whether finding a backdoor set of size 
at most k is also fixed-parameter tractable. In particular, for every base class C one can ask whether 
the detection of a strong or weak backdoor set into C of size at most k is fixed-parameter tractable 
for parameter k (possibly in combination with restrictions on the input or other parameters). A sys¬ 
tematic study of the parameterized complexity of backdoor set detection was initiated by Nishimura 
et al. [s^ for SAT, who showed that the detection of strong backdoor sets into the classes HORN 
and 2CNF is fixed-parameter tractable, but the detection of weak backdoor sets into any of these two 
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classes is W[2]-hard (and hence unlikely to be fixed-parameter tractable). Since then, the parameterized 
complexity of backdoor set detection has become an active research topic as outlined in a survey [l^ . 

In this work, we provide two significant extensions to the exciting research on fixed-parameter 
tractable backdoor set detection. First, we introduce heterogeneous base classes, which can be com¬ 
posed of several homogeneous base classes. We show that heterogeneous base classes are particularly 
well-suited for strong backdoor sets, since they allow that different instantiations of the backdoor vari¬ 
ables result in instances that belong to different base classes, which leads to arbitrary reductions in the 
size of a strong backdoor set even compared to the smallest strong backdoor set for each individual 
homogeneous base class. This is in contrast to weak backdoor sets, where the size of a weak backdoor 
set into a heterogeneous base class is equal to the size of the smallest backdoor set into any of the 
individual base classes. Here we show that also the complexity of weak backdoor set detection into 
heterogeneous base classes is tight to its complexity on the individual base classes. Second, we extend 
the scope of backdoor set detection from SAT to CSP, considering target classes that are defined by 
tractable constraint languages in terms of certain closure properties under polymorphisms. 

Heterogeneous Base Classes Consider the following SAT instance = {C, Di,..., Dn} where 
C = (a; V -lOi V • ■ • V -'On) and Di = {^x W biW Ci). It is easy to see that any strong backdoor set 
into HORN needs to contain at least one of the variables bi or Ci from each clause Di, hence such a 
backdoor set must be of size H(n); on the other hand, any strong backdoor set into 2CNF must contain 
at least (n — 2) variables from the clause C; hence such a backdoor must be of size at H(n) as well. 
However, Fn[x = false] is Horn, and Fn[x = true] is a 2CNF, hence the singleton {a:} constitutes a strong 
backdoor set into the “heterogeneous” base class HORNU2CNF. This example shows that by considering 
heterogeneous base classes we can access structural properties of instances that are not accessible by 
backdoor sets into homogeneous base classes. Identifying a base class with a class of instances that are 
solvable by a particular polynomial-time subsolver, one can consider a heterogeneous base class as a 
“portfolio subsolver,” where for each instance the best suitable subsolver from the portfolio is chosen. 

SAT Backdoor Sets A natural question at this point is whether the fixed-parameter tractability 
results for the detection of strong backdoor sets into individual base classes can be extended to more 
powerful heterogeneous base classes. In this work, we completely characterize the complexity landscape 
for heterogeneous base classes obtained arbitrary combinations of the well-known Schaefer classes [38(, 
in the following denoted by HORN (for Horn formulas), HORN” (for Anti-Horn formulas), 2CNF (for 
2-CNF formulas), 0-VAL (for zero valid formulas), and I-VAL (for one valid formulas). To state the 
classification, we briefly introduce some terminology. We say that a pair (C, C) of Schaefer classes is a 
bad pair if C S {HORN, 0-VAL} and C' G {HORN”, 1-VAL}. 

Let C be a class of CNF-formulas such that C = Usgs ^ for some subset S of the Schaefer classes. 
Our main result for SAT backdoor sets (Theorem^ is that Strong C-Backdoor Detection is fixed 
parameter tractable if and only if S does not contain a bad pair of Schaefer classes, otherwise it is 
W[2]-hard. On the other hand, detecting weak backdoors is always W[2]-hard for any choice of C, and 
we show this by building on the known hardness results for when C is a singleton set (Theorem [8|) . 

We also show that Strong C-Backdoor Detection as well as Weak C-Backdoor Detection 
are fixed-parameter tractable for the combined parameter k and the maximum length r of a clause of 
the input formula (Theorem [3 and [H]). These algorithms formalize the intuition that all the hardness 
results in the previous classification result exploit clauses of unbounded length. 

CSP Backdoor Sets The identification of tractable classes of CSP instances has been subject of 
extensive studies. A prominent line of research, initiated by Schaefer in his seminal paper on 
Boolean CSP, is to identify tractable classes by restricting the relations that may appear in constraints 
to a prescribed set, a constraint langu^e. Today, many constraint languages have been identified that 
give rise to tractable classes of CSPs [t [HI. 1^: typically such languages are defined in terms of certain 
closure properties, which ensure that the relations are closed under pointwise application of certain 
operations on the domain. For instance, consider a CSP instance whose relations are closed under a 
constant operation f{x) = d for some d G D (such a operation is a polymorphism of the relations in 
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the instance). Then note that every relation is either empty or forced to contain the tuple (d, d,..., d). 
Thus, given a particular instance, we may either declare it unsatisfiable (if it contains a constraint 
over the empty relation), or satisfy it trivially by setting every variable to d. Further examples of 
polymorphisms for which closure properties yield tractable CSP are min, max, majority, affine, and 
Mal’cev operations [H, Hi- 

We study the problem of finding strong and weak backdoor sets into tractable classes of CSP instances 
defined by certain polymorphisms. Our main result for CSP backdoors (Theorem [TUI) establishes fixed- 
parameter tractability for a wide range of such base classes. In particular, we show that the detection 
of strong and weak backdoor sets is fixed-parameter tractable for the combined parameter backdoor 
size, domain size, and the maximum arity of constraints. In fact, this result entails heterogeneous base 
classes, as different instantiations of the backdoor variables can lead to reduced instances that are closed 
under different polymorphisms (even polymorphisms of different type). We complement our main result 
with hardness results that show that we lose fixed-parameter tractability when we omit either domain 
size or the maximum arity of constraints from the parameter fTheorems lI2l and ll3l) . Hence, Theorem llOl 
is tight in a certain sense. 


Related Work Recently, two papers dealing with strong backdoor set detection for the constraint 
satisfaction problem have appeared 0,113, which nicely supplement (however are mostly orthogonal 
to) the results in this paper. Carbonnel et al. Q show W[2]-hardness for strong backdoor set detection 
parameterized by the size of the backdoor set even for CSP-instances with only one constraint (however 
with unbounded domain and unbounded arity). They also give a fixed-parameter algorithm for strong 
backdoor set detection parameterized by the size of the backdoor set and the maximum arity of any 
constraint, if the base class is “h-Helly” for any fixed integer h. However, as is also mentioned in Car¬ 
bonnel and Cooper Q, the “h-Helly” property is rather restrictive and orthogonal to our approach of 
considering the domain as an additional parameter. 

Ganian et al. 17[ show fixed-parameter tractability of strong backdoor detection parameterized by 
the size of the backdoor to a very general family of heterogeneous base classes, i.e., base classes defined as 
all CSP-instances obtained from the disjoint union of tractable, finite, and semi-conservative constraint 
languages. These base classes are orthogonal to the base classes considered in this paper. They are more 
general in the sense that they allow for the CSP-instance to consist of pairwise disjoint subinstances each 
belonging to a different tractable class, and they are more specific in the sense that they only consider 
finite and semi-conservative constraint languages, which also restricts the CSP-instances to bounded 
domain and bounded arity. 


2 Preliminaries 

SAT A literal is a propositional variable a: or a negated variable —'X. We also use the notation x = 
and -ix = x^. A clause is a finite set of literals that does not contain a complementary pair x and 
-<x. A propositional formula in conjunctive normal form, or CNF formula for short, is a set of clauses. 
For a clause C we write var(C) = {x : x € C or ^x £ C }, and for a CNF formula F we write 
var(F) = UceFvar(C'). 

For a set X of propositional variables we denote by X {0,1} the set of all mappings from X to 
{0,1}, the truth assignments on X. We denote by X the set of literals corresponding to the negated 
variables of X. For r : A —>■ {0,1} we let true(T) = { ■. x £ X} and false(r) = { : x £ X} 

be the sets of literals set by r to 1 and 0, respectively. Given a CNF formula F and a truth assignment 
T : A ^ {0,1} we define F[t] = { C \ false(r) : C £ F, CD true(r) = 0 }. If r : {x} —>■ {0,1} and 
e = t(x), we simply write F[x = e] instead of Fjr]. 

A CNF formula F is satisfiable if there is some t : var(F) —)• {0,1} with F’jr] = 0, otherwise F is 
unsatisfiable. 

CSP Let be a set and n and n' be natural numbers. An n-ary relation on I? is a subset of D”. 
For a tuple t £ H", we denote by t\i], the f-th entry of t, where 1 < i < n. For two tuples t £ D" and 
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t' G -D" , we denote by tot', the concatenation of t and t'. 

An instance of a constraint satisfaction problem (CSP) / is a triple (V, D, C), where P is a finite set 
of variables over a finite set (domain) D, and (7 is a set of constraints. A constraint c G C consists of 
a scope, denoted by V(c), which is an ordered list of a subset of V, and a relation, denoted by R(c), 
which is a |td(c)|-ary relation on D. To simplify notation, we sometimes treat ordered lists without 
repetitions, such as the scope of a constraint, like sets. For a variable v GV{c) and a tuple t G R{c), we 
denote by t[v], the f-th entry of t, where i is the position of u in F(c). For a CSP instance I = (F, D, C) 
we sometimes denote by V{I), D{I), C{I), and S{I), its set of variables F, its domain D, its set of 
constraints C, and the maximum arity of any constraint of I, respectively. 

Let F' C F and t : V ^ D. For a constraint c G C, we denote by c[t], the constraint whose scope 
is F(c) \ F' and whose relation contains all |F(c[T])|-ary tuples t such that there is a |F(c)|-ary tuple 
t' G R{c) with t[v\ = t'[v\ for every v G F(c[t]) and t'[v\ = t(v) for every v G V . We denote by /[r] the 
CSP instance with variables F \ F', domain D, and constraints C[t], where C[t\ contains a constraint 
c[t] for every c G C. 

A solution to a CSP instance / is a mapping t : V ^ D such that (r[r!i],...,'r[r)|v'(c)|]) G R{c) for 
every cGC with F(c) = {vi ,..., W|v(c)|). 

Backdoors Backdoors are defined relative to some fixed class C of instances of the problem under 
consideration (i.e., SAT or CSP). One usually assumes that the problem is tractable for instances from 
C, as well as that the recognition of C is tractable. 

In the context of SAT, we define a strong C-backdoor set of a CNF formula F to be a set B of 
variables such that F[t] G C for each t : B ^ {0,1}. A weak C-backdoor set of F is a set B of variables 
such that F[t] is satisfiable and F[t] G C holds for some t : B —>■ {0,1}. If we know a strong C-backdoor 
set of F, we can decide the satisfiability of F by checking the satisfiability of 2^ “easy” formulas F[t] 
that belong to C. Thus SAT decision is fixed-parameter tractable in the size k of the backdoor. Similarly, 
in the context of CSP, we define a strong C-backdoor set of a CSP instance I = (F, D, C) as a set B of 
variables such that /[r] G C for every t : B ^ D. We also call a strong C-backdoor a strong backdoor set 
into C. A weak C-backdoor set of / is a set B of variables such that I[t] is satisfiable and I[t] G C holds 
for some t : B ^ D. If we know a strong C-backdoor set of I of size k, we can reduce the satisfiability 
of I to the satisfiability of d^ CSP instances in C where d = \D\. Thus deciding the satisfiability of a 
CSP instance is fixed-parameter tractable in the combined parameter d -\- k. 

The challenging problem is—for SAT and for CSP—to find a strong, or weak C-backdoor set of size 
at most k if one exists. 

For each class C of SAT or CSP instances, we define the following problem. 

Strong C-Backdoor Detection 

Input: A SAT or CSP instance I and a nonnegative integer k. 

Question: Does I have a strong C-backdoor set of size at most kl 

The problem Weak C-Backdoor Detection is defined similarly. 


Parameterized Complexity We provide basic definitions of parameterized complexity; for an in- 
depth treatment we refer to the recent monograph 
instance / is associated with a nonnegative integer k, the parameter. 


ll| . A problem is parameterized if each problem 
A parameterized problem is 


fixed-parameter tractable (or FPT, for short) if there is an algorithm. A, a constant c, and a computable 
function /, such that A solves instances of input size n and parameter k in time f{k)n‘'. Fixed-parameter 
tractability extends the conventional notion of polynomial-time tractability, the latter being the special 
case where / is a polynomial. The so-called Weft-hierarchy W[l] C W[2] C ... contains classes of 
parameterized decision problems that are presumed to be larger than FPT. It is believed that problems 
that are hard for any of the classes in the Weft-hierarchy are not fixed-parameter tractable. The classes 
are closed under fpt-reductions that are fixed-parameter tractable many-one reductions, which map an 
instance x with parameter k of one problem to a decision-equivalent instance x' with parameter k' of 
another problem, where k' < f{k) for some computable function /. 

For instance, the following problem is well-known to be W[2]-complete 111. 
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Hitting Set Parameter: k 

Input: A set system Q over a set (universe) U, and a positive integer k 

Question: Is there a subset X C U, with |X| < k such that every set S G Q contains at least one 
element from XI 


3 Backdoor Detection for SAT 

Schaefer’s base classes [s^ give rise to classes of CNF formulas defined in terms of syntactical properties 
of clauses 0 A clause is 

• Horn if it contains at most one positive literal, 

• Anti-Horn if it contains at most one negative literal, 

• 2CNF if it contains at most two literals^ 

• 0-valid if it is empty or contains at least one negative literal, and 

• 1-valid if it is empty or contains at least one positive litera0. 

A CNF formula is Horn, Anti-Horn, etc. if it contains only Horn, Anti-Horn, etc. clauses. We denote 
the respective classes of CNF formulas by HORN, HORN”, 2CNF, 0-VAL, and 1-VAL. 

3.1 Strong Backdoor Sets 

In this section we study the parameterized complexity of Strong C-Backdoor Detection, where C is 
the union of any subset of the Schaefer classes. In the case that C consists only of a single Schaefer class 
this has been intensively studied before [l^,[^ and it is known that Strong C-Backdoor Detection 
is polynomial for 0-VAL and I-VAL, and FPT for the remaining Schaefer classes. 

The results in this subsection are as follows. First we show that Strong H-Backdoor Detection 
is fixed-parameter tractable parameterized by the size of the backdoor set for various combinations of the 
Schaefer classes, i.e., the case that H = 2CNFUC for any C G Schaefer is shown in Theoremj^l the cases 
that H. = HORN U 0-VAL and H. = HORN” U I-VAL are shown in Theorem [31 and finally Theorem 2] 
shows tractability for the cases that H. = 2CNFUHORNU0-VAL and H = 2CNFUHORN” U 1-VAL. In 
contrast. Theorem [3] captures a condition for hardness of the strong backdoor detection problem, which 
is complementary to all the cases in the first three results. Together, and combined with known results 
in the literature, these theorems give us a complete complexity classification, which is summarized in 
Theorem [31 

Our hardness results rely on the fact that the clauses have unbounded length. Theorem 0 shows 
that if we consider the maximum length of any clause as an additional parameter, then the problem of 
finding a strong heterogeneous backdoor becomes fixed-parameter tractable for any combination of the 
Schaefer classes. 

As mentioned above we employ branching algorithms for the detection of strong backdoor sets. In 
particular, all algorithms are based on a depth-bounded search tree approach and mainly differ in the 
set of employed branching rules. Informally, the algorithms construct a search tree, where each node is 
labeled with a potential backdoor set and the crucial difference between the algorithms lies in chosing 
the set of possible extensions of the potential backdoor set on which to branch on. In this way backdoor 
set detection can be reduced to the following problem, which given a potential backdoor set B' computes 
the set Q of potential extensions of B', which can be branched on in the next step. 

^AfBne Boolean formulas considered by Schaefer do not correspond naturally to a class of CNF formulas, hence we do 
not consider them here. However, we do consider the afHne case in the context of Boolean CSPs; cf. Theorem ll2l 
clause containing exactly two literals is also known as a Krom clause l26l . 

®0-valid and 1-valid is often defined more restrictively, not allowing empty clauses. 
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C-Strong Branch 

Input: A CNF formula F with variables V, a non-negative integer k, and a subset B’ C V with 

\B'\ < k. 

Question: Output: 

• Yes iff B' is a strong C-backdoor set of F, or 

• a family Q of subsets of V \B' such that: 

B1 for every Q G Q it holds that \B' LlQ\ < k, and 

B2 for every strong C-backdoor set B of F with B' C B and \B\ < k there is a Q € Q such 
that B' U Q C B. 

The next theorem shows that Strong C-Backdoor Detection can be reduced to C-Strong 
Branch via a branching algorithm that uses an algorithm A for C-Strong Branch as a subroutine to 
determine the set of braching choices. Importantly the running time of the resulting branching algorithm 
for Strong C-Backdoor Detection crucially depends on the “type” of braching rules returned by 
A, or in other words the branching factor. To be able to quantify the branching factor we introduce the 
function Tj^{k) defined recursively as 

TA{k) = max ( IQlT^lfc — min IQI) | and Ta{0) = 0 

SeO \ QeQ J 

where O is the set of all possible output sets Q of A on any input F, B' CV with \B'\ < k, and k. 

Theorem 1. Let C be a class of CNF formulas, F be a CNF formula, k a non-negative integer, and 
let A be an algorithm solving C-Strong Branch. Then Strong C-Backdoor Detection can be 
solved in time fc) • TA{k)), where fc) is the time required by A on any input F, B' CV 

with \B'\ < k, and k. 

Proof. We will employ a branching algorithm that employs the algorithm A solving C-Strong Branch 
as a subroutine. The algorithm uses a depth-bounded search tree approach to find a strong C-backdoor 
set of size at most k. Let F be any CNF-formula with variables V. 

We construct a search tree T, for which every node is labeled by a set B of at most k variables of V. 
Additionally, every leaf node has a second label, which is either Yes or No. T is defined inductively as 
follows. The root of T is labeled by the empty set. Furthermore, if t is a node of T, whose first label is 
B', then the children of t in T are obtained as follows. First we run the algorithm A on F, B', and k. 
If A returns Yes then t becomes a leaf node, whose second label is Yes. Otherwise let Q be the set of 
subsets of V \B' returned by „4. If Q = 0, then there is no strong C-backdoor set B of F with B' C B 
and \B\ < k and consequently t becomes a leaf node whose second label is No. Otherwise, i.e., if Q 7 ^ 0 
then t has a a child for every Q € Q, whose first label is B' U Q. This completes the definition of T. If 
T has a leaf node, whose second label is Yes, then the algorithm returns the first label of that leaf node 
as a strong C-backdoor set of F. Otherwise the algorithm returns No. This completes the description 
of the algorithm. 

We now show the correctness of the algorithm. First, suppose the search tree T built by the algorithm 
has a leaf node t whose second label is Yes. Here, the algorithm returns the first label, say B of t. By 
definition of T, we obtain that \B\ < k and B is a strong C-backdoor set of F. 

Now consider the case where the algorithm returns No. We need to show that F has no strong 
C-backdoor set of size at most k. Assume, for the sake of contradiction that such a set B exists. 

Observe first that for every leaf node t of T whose second label is No, it holds that B' ^ B. This is 
because the algorithm A on input F, B', and k returned the empty set for Q. Hence if t is a leaf node 
whose first label B' satisfies B' C B, then the second label of t must be Yes. 

It hence remains to show that T has a leaf node whose first label is a set B' with B' C B. This will 
complete the proof about the correctness of the algorithm. We will show a slightly stronger statement, 
namely, that for every natural number £, either T has a leaf whose first label is contained in B or T 
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has an inner node of distance exactly i from the root whose first label is contained in B. We show the 
latter by induction on £. 

The claim obviously holds for £ = 0. So assume that T contains a node t at distance ^ from the 
root of T whose first label, say B', is a subset of B. If t is a leaf node of T, then the claim is shown. 
Otherwise, the algorithm A determined that B' is not a strong C-backdoor set of F and returned a set 
Q of subsets oiV\B' satisfying B2. Hence there is a Q G Q with B' U Q C B and because t contains a 
child t' whose first label is B' U Q, we obtain that t' is a node at distance \ satisfying the induction 
invariant. This concludes the proof concerning the correctness of the algorithm. 

The running time of the algorithm is obtained as follows. Let T be a search tree obtained by the 
algorithm. Then the running time of the depth-bounded search tree algorithm is 0(|H(T)|) times the 
maximum time that is spend on any node of T. Since |H(T)| < T^(fc) and the time spend on any node 
is at most Ra{F, k), the stated running time follows. □ 

Our first tractability result is for Strong (2CNF UC)-Backdoor Detection, where C G Schaefer. 
Before formally stating the algorithm, we provide a brief intuition for the branching rules. Typically, 
we branch on an obstructing clause (or a pair of obstructing clauses), a situation that occurs in one of 
two flavors: 

(1) Either there is a clause C that is not in 2CNF U C, or, 

(2) there is a pair of clauses C and C, where C G 2CNF \ C and C gC \ 2CNF. 

Consider the case when C = 0-VAL. In scenario (1), the formula has a clause C of length at least 
three with only positive literals. Because C has only positive literals, it follows that no subclause of C 
is 0-valid. Hence every backdoor set has to hit all but at most two literals from C. It follows that if C 
has more than k + 2 literals, then there is no strong backdoor of size k to 0-VAL U 2CNF. Otherwise, 
the clause has at most k + 2 literals on which we can branch exhaustively. In scenario (2), we know 
that C has at most two literals and because C ^ 0-VAL we obtain that any strong backdoor set that 
does not contain a variable from C must reduce C to a clause with at most two literals. Hence, any 
strong backdoor set has to contain either a variable from C (but there are at most two of those) or must 
contain a variable from any subset of three variables in C. 

Consider the case when C = HORN. In scenario (1) above, the formula has a clause C of length at 
least three with at least two positive literals, say x and y. Let ^ be any other literal in C. It is clear 
that any strong backdoor set must contain one of x, y or and we branch accordingly. In scenario (2), 
we know that C has at most two literals and C contains at least two positive literals. Because every 
strong backdoor set that does not contain a variable from C must reduce C to a clause with at most 
two literals, we can branch on the variables in C and any subset of three variables in C'. 

The intuitive explanations for 1-VAL and HORN” are analogous. Note that the criteria described 
above need to be refined further at internal nodes of the branching process. We now turn to a formal 
exposition of the algorithm. 

Theorem 2. LetC G Schaefer. Then, Strong (2CNF UC)-Backdoor Detection is fixed-parameter 
tractable parameterized by the size of the backdoor set. 

Proof. Let Tl = 2CNFUC. We will provide an algorithm A solving 7^-Strong Branch in time 0{2^\F\) 
and satisfying T^ik) = 9^. The result then follows from Theorem[T] 

Let F be the given CNF formula with variables V, k the given integer, and B' GV with \B'\ < k. 
The algorithm A first checks whether F[t] G TL for every assignment t : B' ^ {0,1}. If that is the case 
then A returns Yes. Otherwise let t ■. B' ^ {0,1} be an assignment such that F[r] ^ TL, we consider 
two cases: 

1. if \B'\ = fc, then there is no strong 'H-backdoor set B oi F with B' C B and \B\ < k and the 
algorithm A correctly returns the set Q = 0, 

2. otherwise, i.e., if \B'\ < k, then we again distinguish two cases: 
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(a) there is a clause C e F[t] with C ^Ti., then we again distinguish two cases: 

i. if C G {HORN, HORN”}, then let O be a subset of C with |0| = 3 and O ^C. Then A 
returns the set Q = { {v} : v € var(O) }. 

To show the correctness of this step assume that there is a strong ?^-backdoor set B of F 
with B' C B and \B\ < k. It suffices to show that BCiO ^ 0. Suppose for a contradiction 
that this is not the case and let t* : B ^ {0,1} be an assignment compatible with r that 
does not satisfy any literal of C. Then, F[r*] contains a superset of O as a clause and 
hence F[t*] ^ 2CNF U C, contradicting our assumption that H is a strong ?I-backdoor 
set of F. 

ii. if C G (l-VAL, O-VAL), then we again distinguish two cases: 

A. \B'\ + \C\—2>k, then the algorithm A returns the set Q = 0. 

The correctness of this step follows from the fact that every strong ?^-backdoor set 
B of F with B' Q B has to contain all but at most two variables of C and hence B 
would have size larger than k. 

B. otherwise, i.e., if |i?| + jCI — 2 < k, then the algorithm A returns the set Q containing 
all sets var(C' \ O) for every two element subset O of C. 

To show the correctness of this step assume that there is a strong "H-backdoor set B 
of F with B' C B and \B\ < k. It remains to show that |var(C') \ B\ < 2. Assume 
not, and let t* : B ^ {0,1} be an assignment compatible with r that does not satisfy 
any literal of C. Then, C[t*] G and because |C'[r*]| > 2 and C[t*] ^ C, also 

F[t*] ^ 2CNF U C, contradicting our assumption that H is a strong "H-backdoor set 
of F. 

(b) otherwise, there is a clause C in F[t] with C G 2CNF \ C and a clause C in F[t] with 
C' G C \ 2CNF. Let O be the set of variables containing the (at most) two variables from C 
plus any three element subset of variables from C. Then, the algorithm A returns the set 
Q = { {u} : V G O}. 

To show the correctness of this step assume that there is a strong ?I-backdoor set B of F 
with B' C B and \B\ < k. It remains to show that B fl var(O) 0. Assume not, and let 
T* : B ^ {0,1} be an assignment compatible with r that does not satisfy any literal of C. 
Then, F[t*] contains the clause C together with a subclause C" of C containing at least 
three literals. Because C and C" ^ 2CNF, we obtain that F[t*] ^ 2CNF UC. 

This completes the description and the proof of correctness of the algorithm A. The running time of A 
is the number of assignments of the at most k variables in B' times the time required to test whether the 
reduced formula is in T-L, i.e., A runs in time 0(2^|F|). It remains to obtaine the stated bound of 9^ for 
the function Tj^{k). According to the three branching rules given in (2.a.i), (2.a.ii.B), and (2.b) we obtain 
that T^{k) can be bounded by the maximum of the following recurrence relations: (1) Tj((k) = 3T^(fc—1) 
(2.a.i), (2) T^{k) = r^T^{k — (r — 2)) for any r > 2 (2.a.ii.B), and (3) Tjs,{k) = 5T^{k — 1) (2.b). In case 
of (1) Ty^{k) is at most 3^, in case of (2) the maximum of 9^ is obtained for Tj^{k) by setting r = 3, and 
in case of (3) Tjx{k) is at most 5^. It follows that Tj^{k) <9^. □ 

We now turn to the case when the base classes are either HORN U O-VAL or HORN” U I-VAL. 

As before, in scenario (I), the length of an obstructing clause must be bounded in a Yes- instance. 
Indeed, consider a clause C that is not in HORN U O-VAL. Such a clause has only positive literals, and 
any strong backdoor set must contain all these literals but one. Therefore, if the clause has more than 
fc -I- I literals, we reject the instance, otherwise the clause length is bounded, allowing for a bounded 
branching strategy. 

In scenario (2), on the other hand, we have a pair of obstructing clauses, say C G HORN \ O-VAL, 
and C G O-VAL \ HORN. Now, note that C can have only one literal by definition. Let O be any subset 
of two positive literals from C. Note that any strong backdoor must intersect either O or the unique 
literal in C, which again leads us to a feasible branching step. We now turn to a formal description of 
the algorithm. 



Theorem 3. Strong (HORNUO-VAL)-Backdoor Detection and Strong (HORN U1-VAL)-Back- 
DOOR Detection are fixed-parameter tractable parameterized by the size of the backdoor set. 

Proof. We only give the proof for the class HORN U 0-VAL, as the proof for the class HORN” U 1-VAL 
is analogous. Let H — HORN U 0-VAL. We will provide an algorithm A solving "H-Strong Branch 

in time 0{2^\F\) and satisfying Tj^{k) = 3^. The result then follows from Theorem[TJ 

Let F be the given CNF formula with variables V, k the given integer, and B' CV with \B'\ < k. 
The algorithm A first checks whether F'[t] € H for every assignment t : B' ^ {0,1}. If that is the case 
then A returns Yes. Otherwise let t : B' ^ {0,1} be an assignment such that F[t] ^ T-t, we consider 
two cases: 

1. if \B'\ = k, then there is no strong H-backdoor set B of F with B' C B and \B\ < k and the 
algorithm A correctly returns the set Q = 0, 

2. otherwise, i.e., if \B'\ < fc, then we again distinguish two cases: 

(a) there is a clause C G F’[r] with C then we again distinguish two cases: 

i. |B| + |C|-1 > k, then the algorithm A returns the set Q = %. 

To show the correctness of this step it is sufficient to show that F does not have a strong 
"H-backdoor set B with B' C B and \B\ < k. Assume for a contradiction that this is not 
the case and let B be strong H-backdoor set of F with B' C B and \B\ < k. Because 

\B'\ + \C\-1 > k we obtain that B misses at least two literals of C. Let t' : B —>■ {0,1} 

be any assignment that is compatible with r and does not satisfy any literal from C. 
Then F[t'] contains a subclause of C that contains at least two literals. It follows that 
F'[t'] ^ B, contradicting our assumption that H is a strong "H-backdoor set for F. 

ii. otherwise, i.e., if iHl-t-ICI —I < k, then A returns the set Q = {var(C')\{u} : v G var(C') }. 

To show the correctness of this step assume that there is a strong "H-backdoor set B of 
F with B' <Z B and \B\ < k. It suffices to show that |var(C') \ i?| < 1. Assume not, 
and let r* : H —>■ {0,1} be an assignment compatible with r that does not satisfy any 
literal of C. Then, C[t*] G A[t*] and because |C'[t*]| > I and C[t*] ^ 0-VAL, also 
F'[t*] ^ HORN U 0-VAL, contradicting our assumption that H is a strong "H-backdoor 
set of F. 

(b) otherwise, there is a clause C in F[t] with C G HORN \ 0-VAL and a clause C in F'[r] with 
C G 0-VAL \ HORN. Let O be the set of variables containing the one variable from C plus 
any two variables corresponding to two positive literals from C. Then A returns the set 
Q = { {u} :v € O}. 

To show the correctness of this step assume that there is a strong H-backdoor set B of F 
with B' C B and \B\ < k. It sufficies to show that B Ci O ^ 0. Assume not, and let 
T* : i? —> {0,1} be an assignment compatible with r that does not satisfy any literal of C. 
Then, F’[t*] contains the clause C together with a subclause C" of C containing at least two 
positive literals. Because C ^ 0-VAL and C ^ HORN, we obtain that F’[t*] ^ B. 

This completes the description and the proof of correctness of the algorithm A. The running time of A 
is the number of assignments of the at most k variables in B' times the time required to test whether the 
reduced formula is in B, i.e., A runs in time 0{2^\F\). It remains to obtaine the stated bound of 3^ for 
the function T^ik). According to the two branching rules given in (2.a.ii) and (2.b) we obtain that Tj({k) 
can be bounded by the maximum of the following recurrence relations: (1) Tj^{k) = rT^{k — (r — 1)) 
for any r >2 (2.a.ii) and (2) Tji,{k) = 3Tx(/c — I) (2.b). In case of (I) the maximum of 2^ is obtained 
for T^{k) by setting r = 2, and in case of (2) Tji,{k) is at most 3^. It follows that T^(k) < 3^. □ 

In our next result, we consider heterogeneous base classes comprised of three Schaefer’s classes, 
namely 2CNF U HORN U 0-VAL and 2CNF U HORN” U 1-VAL. We refer the reader to Figure [T] for an 
overview of the branching strategies employed here. 
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Theorem 4. Strong (2CNF U HORN U 0-VAL) -Backdoor Detection and Strong (2CNF U 
HORN” U 1-VAL) -Backdoor Detection are fixed-parameter tractable parameterized by the size of 
the backdoor set. 

Proof. We only give the proof for the class 2CNF U HORN U 0-VAL, since the proof for the class 
2CNFUHORN” U 1-VAL is analogous. Let T-L = 2CNFU HORN U 0-VAL. We will provide an algorithm 
A solving "H-Strong Branch in time 0{2^\F\) and satisfying Ty\^{k) = 9^. The result then follows 
from Theorem [T] 

Let F be the given CNF formula with variables V, k the given integer, and B' QV with |i?'| < k. 
The algorithm A first checks whether F'[t] € H for every assignment t : B' ^ {0,1}. If that is the case 
then A returns Yes. Otherwise let t : B' ^ {0,1} be an assignment such that F[t] ^ PL, we consider 
two cases: 

1. if |i?'| = k, then there is no strong ?^-backdoor set B oi F with B' F B and \B\ < k and the 
algorithm A correctly returns the set Q = 0, 

2. otherwise, i.e., if \B'\ < k, then we again distinguish two cases: 

(a) there is a clause C € F'[t] with C ^PL, then we again distinguish two cases: 

i. \B'\ -I- ICI — 2 > fc, then the algorithm A returns the set Q = 0. 

To show the correctness of this step it is sufficient to show that F does not have a strong 
"H-backdoor set B with B' C B and \B\ < k. Assume for a contradiction that this is not 
the case and let B be strong H-backdoor set of F with B' C B and \B\ < k. Because 
\B'\ + \C\ — 2 > k, we obtain that B misses at least three literals from C. Hence, there is 
an assignment t' : B ^ {0,1} that is compatible with r such that F'[t'] contains a clause 
C' Q C with \C'\ > 2. Clearly, C' ^ 2CNFU0-VAL and because C contained only positive 
literals (because C ^ 0-VAL)) also C ^ HORN. Hence, C ^ 2CNF U HORN U 0-VAL 
contradicting our assumption that H is a strong H-backdoor set for F. 

ii. otherwise, i.e., if \B\ -|- jCj — 2 < A:, then the algorithm A returns the set Q = { var(C \ 
O) : O C var(C) A |0| = 2}. 

To show the correctness of this step assume that there is a strong H-backdoor set B of 
F with B’ C B and \B\ < k. It suffices to show that |var(C) \ H| < 2. Assume not, and 
let T* : B ^ {0,1} be an assignment compatible with r that does not satisfy any literal 
of C. Then, C[t*] G H[r*] and because |C'[r*]| > 2 and C[t*] contains only positive 
literals, also H[r*] ^ 2CNF U HORN U 0-VAL, contradicting our assumption that H is a 
strong H-backdoor set for F. 

(b) there is a clause C G F[t] with C G 2CNF and C ^ 0-VAL, we again distinguish two cases: 

i. there is a clause C G H[t] with C ^ 2CNF U HORN (note that C G 0-VAL). Let O be 
a set of variables containing all variables of C (at most two) plus three variables of C 
of which two correspond to positive literals in C. Then the algorithm A returns the set 
Q = { {u} : u G O }. 

To show the correctness of this step assume that there is a strong H-backdoor set B of 
F with B' C B and \B\ < k. It sufficies to show that H O O 0. Assume not and 
let T* : B ^ {0,1} be an assignment compatible with r, which does not satisfy any 
literal of C". Then, H[r*] contains C as well as a subclause C" of C of length at least 
three, which contains at least two positive literals. Then, C" ^ HORN U KROM and 
because C ^ 0-VAL, we obtain that H[r*] ^ 2CNF U HORN U 0-VAL, contradicting our 
assumption that H is a strong H-backdoor set for F. 

ii. otherwise, there are clauses C' 2 Cnf, Ohorn & F[t] with C' 2 Cnf G 2CNF \ HORN and 

Ohorn G horn \ 2CNF. Let O be a set of variables containing all variables of C 2 CNF 
(at most two) and three variables of Chorn- Then the algorithm A returns the set 
2 = G O}. 


10 


There is a clause C & F[t] 
C i 2CNF U HORN U 0-VAL 


\B\ + \C\-2>k 
Say No. 


\B\ < k 


/ There is a clause C G A[t] 

K - 

\ C € 2CNF and C ^ HORN 


There is a clause C & F[t] 
C G 2CNF and C ^ 0-VAL 


\B\ + \C\-2<k 
Branch. 

3C', C i 2CNF U HORN 
X ^ Branch. 


dCi, C 2 with 

'hCi G 2CNF \ HORN and C 2 G HORN \ 2CNF. 
Branch. 


3C G 2CNF and C i HORN. 
Implies C ^ 0-VAL, subsumed above. 


Figure 1: The branching cases for Theorem 0] 


To show the correctness of this step assume that there is a strong H-backdoor set B of 
F with B' Q B and \B\ < k. It suflicies to show that B DO 7 ^ 0. Assume not and let 
T* : B ^ {0,1} be an assignment compatible with r, which does not satisfy any literal 
of Chorn- Then, F[r*] contains C 2 CNF as well as a subclause C' of Chorn of length at 
least three. Because C2CNF G 2CNF \ HORN it follows that C'2Cnf ^ 0-VAL. Because 
also C ^ 2CNF, we obtain that ^ 2CNF U HORN U 0-VAL, contradicting our 

assumption that H is a strong "H-backdoor set for F. 

(c) there is a clause C G F[r] with C G 2CNF and C ^ HORN. In this case also C ^ 0-VAL and 
hence the case is covered by case ( 2 .b). 

(d) The cases (2.a), (2.b), and (2.c) completely cover all possible cases, because if none of these 
cases apply then F[t] G 0-VAL, which contradicts the choice of r. To see this assume there 
is a clause C G F[t] with C ^ 0-VAL. Because of the cases (2.b) and (2.c), we obtain 
that C ^ 2CNF. Furthermore, because of case (2.a), we obtain that C G HORN. Hence, 
C G HORN \ (0-VAL U 2CNF) but such a clause cannot exist. 

This completes the description and the proof of correctness of the algorithm A. The running time of A 
is the number of assignments of the at most k variables in B' times the time required to test whether 
the reduced formula is in H, i.e., A runs in time 0(2^ |F|). It remains to obtain the stated bound of 9^ 
for the function Tj^{k). 

According to the three branching rules given in (2.a.ii), (2.b.i), and (2.b.ii) we obtain that T^{k) 
can be bounded by the maximum of the following recurrence relations: ( 1 ) Tj\^{k) = ATj\{k — (r — 2 )) 
for any r > 2 (2.a.ii), (2) Tj^{k) = bTjs,{k — 1) (2.b.i), and (3) Tj^{k) = bTjs,{k — 1) (2.b.ii). In case of 
(2) and (3) T^ik) is at most 5^ and in case of (1) the maximum of 9^ is obtained for T^ik) by setting 
r = 3. It follows that Ty{{k) < 9^. 

□ 

We say that a pair (C,C') of Schaefer classes is a bad pair if C G {HORN, 0-VAL} and C' G 
{HORN”, l-VALj. Our next result establishes hardness for the case when the base class contains a 
bad pair. 
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Theorem 5. For every S C Schaefer that contains at least one bad pair of Schaefer classes, it holds that 
Strong C-Backdoor Detection is W[2]-hard parameterized by the size of the backdoor set, where 
C = Uses«- 

Proof. We give a parameterized reduction from the W[2]-complete Hitting Set problem. Given an 
instance {Q,U,k) for Hitting Set, construct a formula F as follows. The variables of F are Q U 
{ dg,...,■ Q € Q}. For each set Q € Q, there is one clause cq = Q U { dg, ...,}■ There 
is also one clause cjj = {-lU : u € U}. This completes the description of the reduction. 

We claim that Q has a hitting set of size at most k if and only if the formula F has a strong C- 
backdoor set of size at most k. Suppose X C U, |X| < fc, is a hitting set. To show that X is also 
a strong C-backdoor set, consider any assignment r : X — 5> {0,1}. If t{x) — 0 for some x € X, then 

T satisfies the clause cu- Thus, F’[t] € C since each clause in F[t] contains no negative literal and at 

least one positive literal and is hence in 1-VAL fl HORN”. If t{x) = I for all x G A, then all clauses 
cq,Q G Q, are satisfied by t since A is a hitting set. The only remaining clause is HORN and 0-VAL 
since it has no positive literal and at least one negative literal. 

For the other direction, suppose that A is a strong C-backdoor set of size at most k. Obtain X' from 
X by replacing each dg G X for any i with l<f<|C/\Q|by some variable from Q. 

The set X' is also a strong C-backdoor set of size at most k. Therefore, the assignment r : A' —>• {0,1} 
with t(x) = 1 for all x G A' must satisfy all clauses cq,Q G Q. Otherwise, F[t] contains both 
a long clause containing only positive and a long clause containing only negative literals and hence 

i U^Gss- Thus, A' is a hitting set for Q of size at most k. □ 

We now summarize our results in the following theorem. 


Theorem 6. Let S C Schaefer and C = Llsgs®- Then, Strong C-Backdoor Detection is fixed 
parameter tractable parameterized by the size of the backdoor set if S does not contain a bad pair of 
Schaefer classes, otherwise it is W[2]-dard. 


Proof. If S contains a bad pair then the result follows from Theorem [5] So assume that S does not 
contain a bad pair, we distinguish the following cases: 






if I S'I = 1, then the theorem follows from known results, see, for example, 18|, 


if |S| = 2, then either 2CNF G S in which case the result follows from Theorem [51 or S = 
{HORN, 0-VAL} or S = (HORN”, I-VAL} in which case the result follows from Theorem |3l 


• if |S| = 3, then either 2CNF G S and hence S = {2CNF, HORN, O-VAL) or S = {2CNF, HORN”, 1-VAL} 
in which case the result follows from Theorem 01 or 2CNF ^ S in which case S contains at least 
one bad pair, and the result follows from Theorem [5l 


• if |S| > 3, then S contains at least one bad pair, and the result follows from Theorem[5l 


□ 


It is crucial for these hardness proofs that clauses have unbounded length. Indeed, if clause-lengths 
are bounded or if we add the maximum clause length to the parameter, then strong backdoor detection 
becomes FPT for any combination of Schaefer classes. 

Theorem 7. Let C be a class of CNF-formulas such that C = UsgS ^ some S C Schaefer. Then, 
Strong C-Backdoor Detection is fixed-parameter tractable for the combined parameter k and the 
maximum length r of a clause of the input formula. 

Proof. We will provide an algorithm A solving C-Strong Branch in time 0(2^|F|) and satisfying 
Tj({k) = (|S'|r))^^^. The result then follows from Theorem[T] 

Let F be the given CNF formula with variables V , k the given integer, and B' QV with |i?'| < k. 
The algorithm A first checks whether F[t] G C for every assignment t : B' ^ {0,1}. If that is the case 
then A returns Yes. Otherwise let t : B' ^ (0,1} be an assignment such that F[t] ^ C, we consider 
two cases: 
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1. if jS'l = fc, then there is no strong C-backdoor set B oi F with B' C B and \B\ < k and the 
algorithm A correctly returns the set Q = 0. 

2. otherwise, i.e., if \B'\ < k, then let Cg be an arbitrary clause in F[t] with Cg ^ s for every s € S. 
The algorithm A then returns the set Q = Usesi ^ ^ var(Cs) }. 

To show the correctness of this step assume that there is a strong C-backdoor set B of F with 
B' C B and \B\ < k. It sufhcies to show that B fl ^ t* : B ^ {O;!} be 

any assignment of the variables in B that agrees with r on the variables in B' and let s G S' 
be such that F[t*] G s. Because i? is a strong C-backdoor set, s clearly exists. We claim that 
var(C') n B 0 for every clause C G F[t] with C ^ S (and hence in particular for the clause 
Cg chosen by ^). Indeed, suppose not. Then C G a contradiction to our assumption that 

G s. 

This completes the description and the proof of correctness of the algorithm A. The running time of A 
is the number of assignments of the at most k variables in B' times the time required to test whether 
the reduced formula is in C, i.e., A runs in time 0(2^|F|). It remains to obtain the stated bound of 
for the function T^(fc), which follows because \Q\ < |S'|r and \Q\ = 1 for every Q € Q. □ 

3.2 Weak Backdoor Sets 

It turns out that the complexity of finding weak backdoors into heterogeneous classes is tightly tied to 
the complexity of finding weak backdoors into the individual base classes. Let us briefly consider the 
intuition for this scenario before stating the result. Suppose we are given algorithms for finding weak 
backdoors into base classes Ci and C 2 . Observe that a CNF-formula has a weak backdoor of size at 
most k into (Ci U C 2 ) if, and only if, it has a weak backdoor of size at most k into at least one of Ci or 
C 2 . Thus, if we have a positive algorithmic result for the weak backdoor question with respect to the 
individual base classes, then these algorithms can be easily combined to obtain an algorithm for finding 
weak backdoors into the corresponding heterogeneous base class. On the other hand, a hardness result 
for even one of the involved classes can usually be leveraged to obtain a hardness result for the entire 
heterogeneous problem, by adding gadgets that isolate the issue to only one of the base classes, typically 
the one that was hard to begin with. In particular, we obtain the following result. 

Theorem 8. Let S C Schaefer and C = Usgs '®- Then, Weak C-Backdoor Detection is W[2]-hard 
parameterized by the size of the backdoor set. 

Before we proceed further, we introduce the notion of an obstruction which we will use in the proof. 
Specifically, for S, S' G Schaefer, we say that a clause C is an S'-obstruction to S" if C G S' and C ^ S'. 
For any pair of Schaefer classes S and S', it is easy to construct examples of S-obstructions to S'. In 
particular, we have: 

1. (-la;, -ly, -^z) is a HORN-obstruction to 2CNF, and also a 0-VAL-obstruction to 2CNF. 

2. {x,y,z) is an HORN“-obstruction to 2CNF, and also a 1-VAL-obstruction to 2CNF. 

3. {x, y, -iz) is a 0-VAL-obstruction to HORN. 

4. {x,y) is an X-obstruction to HORN for all X G {2CNF, HORN”, 1-VAL}. 

5. (-ix, -ly, z) is a 1-VAL-obstruction to HORN”. 

6 . (-la;, -ly) is an A-obstruction to HORN” for all X G {2CNF, HORN, 0-VAL}. 

7. (a:) is an X-obstruction to 1-VAL for all X G {2CNF, 1-VAL, HORN, HORN”}. 

8 . (-.a;) is an X-obstruction to 1-VAL for all X G {2CNF, 1-VAL, HORN, HORN”}. 

We are now ready to describe our reduction towards Theorem [S] 
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Proof. If I S'! = 1, then the result follows directly from [l^ Proposition 1]. Otherwise, let S £ C. We 
reduce an instance (F, k) of Weak S'-Backdoor Detection to Weak C-Backdoor Detection. 
To this end, for all S' £ C such that S ^ S', introduce (fc + 1) variable-disjoint copies of a S'-obstruction 
to S' using fresh variables for each obstruction, and in particular, disjoint from var(F). We use F' to 
denote the formula obtained from F after adding these obstructions to F, and the reduced instance is 
given by (F', k). 

In the forward direction, if X is a weak backdoor to S for F, and t ■. X ^ {0,1} is such that 
F[t] G S, then observe that F'[t] £ S, since all the added obstructions (which are unaffected by r) were 
in S by definition. It is also easy to see that any satisfying assignment for var(F) \ X can be easily 
extended to an assignment involving the new variables that satisfy all the obstructions (recall that the 
obstructions were variable disjoint, and therefore can be easily satisfied independently). 

On the other hand, let X be a weak backdoor to C for F'. Since X has at most k variables, for 
any r : X —>■ {0,1}, F'[t] contains at least one F'-obstructing clause for each S' £ C such that S' ^ S. 
Therefore, F'[t] must necessarily belong to 5, and therefore X n var(F) is easily checked to be a weak 
backdoor to S for F, as desired. □ 

On the other hand, as with the strong backdoors, we do obtain tractability when we have bounded 
clause lengths. We establish this formally below. 

Theorem 9. Let C be a class of CNF-formulas such that C = Uses ^ some S C Schaefer. Then, 
Weak C-Backdoor Detection is fixed-parameter tractable for the combined parameter k and the 
maximum length r of a clause of the input formula. 

Proof. The algorithm uses a depth-bounded search tree approach to find a weak C-backdoor set of size 
at most k. Let F be any CNF-formula with variables V and let r be the maximum length of any clause 
of F. 

We construct a search tree T, for which every node is labeled by a set B of at most k variables of V. 
Additionally, every leaf node has a second label, which is either Yes or No. T is defined inductively as 
follows. The root of T is labeled by the empty set. Furthermore, if t is a node of T, whose first label is 
B, the children of t in T are obtained as follows. 

If there is an assignment t : B ^ {0,1}, such that F[t] £ C and F[t] is satisfiable, then F is a weak 
C-backdoor set of size at most k, and hence t becomes a leaf node, whose second label is Yes. Otherwise, 
i.e., for every assignment r : B —>■ {0,1} either F[t] ^ C or F[r] £ C but F[t] is not satisfiable, we 
consider two cases: (1) \B\ = k, then t becomes a leaf node, whose second label is No, and (2) \B\ < k, 
then for every assignment r : B —>■ {0,1} such that F[t] ^ C, for every s £ S, and every variable v that 
occurs in some clause C ^ s, t has a child whose first label is F U {i;}. 

If T has a leaf node, whose second label is Yes, then the algorithm returns the first label of that 
leaf node. Otherwise the algorithm returns No. This completes the description of the algorithm. 

We now show the correctness of the algorithm. First, suppose the search tree T built by the algorithm 
has a leaf node t whose second label is Yes. Here, the algorithm returns the first label, say F of t. By 
definition of T, we obtain that \B\ <k and the set F is a weak C-backdoor set of F, as required. 

Now consider the case where the algorithm returns No. We need to show that there is no weak 
C-backdoor set of size at most k for F. Assume, for the sake of contradiction that such a set F exists. 

Observe that if T has a leaf node t whose first label is a set B' with F' C F, then the second label 
of t must be Yes. This is because, either \B'\ < fc in which case the second label of t must be Yes, or 
\B'\ = k in which case B' = B and by the definition of F it follows that the second label of t must be 
Yes. 

It hence remains to show that T has a leaf node whose first label is a set B' with F' C F. This will 
complete the proof about the correctness of the algorithm. We will show a slightly stronger statement, 
namely, that for every natural number i, either T has a leaf whose first label is contained in F or T 
has an inner node of distance exactly £ from the root whose first label is contained in F. We show the 
latter by induction on t. 

The claim obviously holds for .^ = 0. So assume that T contains a node t at distance £ from the 
root of T whose first label, say F', is a subset of F. If t is a leaf node of T, then the claim is shown. 
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Because B is a weak C-backdoor set for F, there is an assignment t* : B ^ {0,1} such that F[r*] G C 
and is satisfiable. In particular, there is some s G S such that ^[t*] G s. Let r : B' -G {0,1} be 
the assignment that agrees with t* on the variables of B'. Because t is not a leaf node, there is a clause 
C G i^[T] with C ^ s such that t has a child t' for every v G var(C). We claim that var(C') fl B 7 ^ 0 and 
hence t has a child, whose first label is a subset of B, as required. Indeed, suppose not. Then C G 
a contradiction to our assumption that G s. This concludes our proof concerning the correctness 

of the algorithm. 

The running time of the algorithm is obtained as follows. Let T be a search tree obtained by the 
algorithm. Then the running time of the depth-bounded search tree algorithm is 0(|F(T)|) times the 
maximum time that is spent on any node of T. Since the number of children of any node of T is bounded 
by 2^|S'|r (recall that r denotes the maximum length of any clause of F) and the longest path from the 
root of T to some leaf of T is bounded by fc -I- I, we obtain that |t^(T)| < 0((2^|S'|r)^~''^). Furthermore, 
the time required for any node t of T is at most 0{2^\F\). Putting everything together, we obtain 
0((2^|5'|r)^+^2*|F'|), as the total running time of the algorithm. This shows that Weak C-Backdoor 
Detection is fixed-parameter tractable parameterized by k, r. □ 

We close this section by noting that backdoor sets with empty clause detection, as proposed by 
0 can be considered as backdoor sets into the heterogeneous base class obtained by the union of a 
homogeneous base class C and the class of all formulas that contain the empty clause. The detection 
of strong backdoor sets with empty clause detection is not fixed-parameter tractable for many natural 
base classes, including Horn and 2CNF [s^. 

4 Base Classes via Closure Properties 

In this section we provide a very general framework that will give rise to a wide range of heterogeneous 
base classes for CSP. 

Given an r-ary relation R over some domain D and an n-ary operation (j) : —>■ D, 

we say that R is closed under (p, if for all collections of n tuples ti,...,tn from R, the tuple 
..., ..., ..., t„[r])) belongs to R. The operation p is also said to be a polymor¬ 

phism of R. We denote by Pol(i?) the set of all operations p such that R is closed under cp. 

Let I = iy, D, C) be a CSP instance and cG C. We write Pol(c) for the set Pol(i?(c)) and we write 
Pol(/) for the set HcgC Pol(c)- We say that I is closed under an operation p, or (/) is a polymorphism of 
/, if (/)G Pol(/). 

We say an operation p is tractable if every CSP instance closed under p can be solved in polynomial 
time. 

Let P(0) be a predicate for the operation p. We call IP(^) a tractable polymorphism property if the 
following conditions hold. 

• There is a constant cp such that for all finite domains D, all operations p over D with property P 
are of arity at most cp. 

• Given an operation p and a domain D, one can check in polynomial time whether P((^) holds on 
all of the at most tuples over D, 

• Every operation with property P is tractable. 

Every tractable polymorphism property P gives rise to a natural base class Cp consisting of all 
CSP-instances I such that Pol(/) contains some polymorphism p with P{p). 

In the following we will provide several illustrative examples for classes of CSP instances that can be 
defined via tractable polymorphism predicates. The chosen examples are mostly based on the Schaefer 
classes or important generalizations thereof. For the definition of the examples we need to define the 
following types of operations. 

• An operation p : D" -G D is idempotent if for every d G D it holds that p{d,... ,d) = d; 


15 


• An operation d> : D ^ D is constant if there is a d G 13 such that for every d' G D, it holds that 

m = d; 

• An operation (j) : > Z3 is a min/max operation if there is an ordering of the elements of D 

such that for every d,d' G D, it holds that (j){d,d') = (j){d',d) = min{d, d'} or (j){d,d') = (j){d',d) = 
max{d, d'}, respectively; 

• An operation (h : ^ 13 is a majority operation if for every d.d' G D it holds that (bid.d.d'j = 

^(d,d',d) = (^(d',d,d) = d; 

• An operation (f) : ^ 13 is an minority operation if for every d,d' G D it holds that (j){d, d,d') = 

4>{d, d', d) = (/'(d', d, d) = d'; 


• An operation </> : ^ Z3 is a Mal’cev operation if for every d,d' G D it holds that ^(d, d, d') = 

4>{d', d, d) = d'. 


It is known that every constant, min/max, majority, minority, and Mal’cev operation is tractable [^, 
[ 2 ^. We denote by VAL, MINMAX, MAJ, MIN, and MAL the class of CSP instances / for which Pol(/) 
contains a constant, a min/max, a majority, a minority, or a Mal’cev polymorphism, respectively. 

Thus VAL, MINMAX, MAJ, MIN, and MAL are the classes Cp for P G {constant, min/max, majority, 
minority, Mal’cev}, respectively. 

In terms of the above definitions we can state the results of 241 and Q as follows. 


Proposition 1. Constant, min/max, majority, minority, and Mal’cev are tractable polymorphism prop¬ 
erties. 


We want to note here that Proposition[T]also applies for other much more general types of operations 
such as semilattice operations (sometimes called ACI operation) (a generalization of min/max), 
k-axy near unanimity operations [isL (a generalization of majority), k-ary edge operations (a 
generalization of Mal’cev), and the two operations of arities three and four [ 2 ^ that capture the bounded 
width property Q (a generalization of semilattice and near unanimity operations). 

In the next sections we will study the problems Strong Cp-Backdoor Detection for tractable 
polymorphism properties P. 


5 Tractability of Backdoor Detection for CSP 

In this section we will show that both Strong Cp-Backdoor Detection and Weak Cp-Backdoor 
Detection parameterized by the size of the backdoor set, the size of the domain, and the maximum 
arity of the CSP instance are fixed-parameter tractable for any tractable polymorphism property P. We 
start by giving the tractability results for strong backdoor sets. 

Theorem 10. Let P be a tractable polymorphism property. Then Strong Cp-Backdoor Detection 
is fixed-parameter tractable for the combined parameter size of the backdoor set, size of the domain, and 
the maximum arity of the given CSP instance. 

Proof. Let P be a tractable polymorphism property, and let {I, k) with / = {V, D, C) be an instance 
of Strong Cp-Backdoor Detection. Let P be the set of all operations on D that have property 
P. Then, P can be constructed in fpt-time with respect to the size of the domain , because there are 
at most |13{^l''’’ cp-ary operations on D and for each of them we can test in polynomial time, 
whether it satisfies property P. The algorithm uses a depth-bounded search tree approach to find a 
strong Cp-backdoor set of size at most k. 

We construct a search tree T, for which every node is labeled by a set B of at most k variables of V. 
Additionally, every leaf node has a second label, which is either Yes or No. T is defined inductively as 
follows. The root of T is labeled by the empty set. Furthermore, if t is a node of T, whose first label is 
B, then the children of f in T are obtained as follows. If for every assignment t : B ^ D there is an 
operation / G P such that I[t] is closed under </>, then B is a strong P-backdoor set of size at most k, 
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and hence t becomes a leaf node, whose second label is Yes. Otherwise, i.e., if there is an assignment 
T : B ^ D such that I[t] is not closed under any operation (j) £ P, we consider two cases: (1) \B\ = fc, 
then t becomes a leaf node, whose second label is No, and (2) \B\ < k, then for every operation cj) £ P 
and every variable v in the scope of some constraint c £ C[t] that is not closed under (j), t has a child 
whose first label is B U {f}. 

If T has a leaf node, whose second label is Yes, then the algorithm returns the first label of that 
leaf node. Otherwise the algorithm returns No. This completes the description of the algorithm. 

We now show the correctness of the algorithm. First, suppose the search tree T built by the algorithm 
has a leaf node t whose second label is Yes. Here, the algorithm returns the first label, say B of t. By 
definition, we obtain that |i?| < fc and for every assignment t : B ^ D, it holds that I[t] is closed under 
some operation in P, as required. 

Now consider the case where the algorithm returns No. We need to show that there is no set B of at 
most k variables of / such that Po1(J[t]) fl P ^ 0 for every assignment r of the variables of B. Assume, 
for the sake of contradiction that such a set B exists. 

Observe that if T has a leaf node t whose first label is a set B' with B' £ B, then the second label 
of t must be Yes. This is because, either \B'\ < fc in which case the second label of t must be Yes, or 
\B'\ = k in which case B' = B and by the definition of B it follows that the second label of t must be 
Yes. 

It hence remains to show that T has a leaf node whose first label is a set B' with B' C B. This will 
complete the proof about the correctness of the algorithm. We will show a slightly stronger statement, 
namely, that for every natural number £, either T has a leaf whose first label is contained in P or T 
has an inner node of distance exactly £ from the root whose first label is contained in B. We show the 
latter by induction on £. 

The claim obviously holds for £ = 0. So assume that T contains a node t at distance £ from the 
root of T whose first label, say P', is a subset of P. If t is a leaf node of T, then the claim is shown. 
Otherwise, there is an assignment r : P' —> P such that I[t] is not closed under any operation from P. 
Let T* : P —>■ P be any assignment of the variables in P that agrees with r on the variables in B' and 
let (j) £ P be such that I[t*] is closed under (p. Because P is a strong P-backdoor set, the polymorphism 
(p clearly exists. By definition of the search tree T, t has a child t' for every variable v in the scope of 
some constraint c G C[t\ that is not closed under (p. We claim that Y(c)OP 0 and hence t has a child, 
whose first label is a subset of P, as required. Indeed, suppose not. Then c £ C[t*] a contradiction to 
our assumption that I[t*] is closed under p. This concludes our proof concerning the correctness of the 
algorithm. 

The running time of the algorithm is obtained as follows. Let T be a search tree obtained by 
the algorithm. Then the running time of the depth-bounded search tree algorithm is 0{\V{T)\) 
times the maximum time that is spent on any node of T. Since the number of children of any 
node of T is bounded by |P|(5(/) (recall that 6(1) denotes the maximum arity of any constraint of 
I) and the longest path from the root of T to some leaf of T is bounded by fc -|- 1, we obtain 
that |Y(r)| < 0((|P|d(/))*''’^). Furthermore, the time required for any node t of T is at most 
0(|P|^comp_rest(/, T)|C'(J[r]) I|P|check_poly(c, (/>)), where comp_rest(/, r) is the time required to com¬ 
pute I[t] for some assignment r of at most k variables and check_poly(c, (/>) is the time required to 
check whether a constraint c of I[t] preserves the operation p £ P. Observe that comp_rest(/, r) 
and |C'(J[t])| are polynomial in the input size. The same holds for check_poly(c, <()), because p is 
a cp-ary operation. Now, the total running time required by the algorithm is the time required to 
compute the set P plus the time required to compute T. Putting everything together, we obtain 
0((|P|5(/))'=+i|P|'=|P|n<=’(i)) = 0((|P|5(/)|P|)'=+2n<='(i)) = 0((|P|'^'°''5(/)|P|)'=+2nO(i)), r as the total 
running time of the algorithm, where n denotes the input size of the CSP instance. This shows that 
Strong Cp-Backdoor Detection is fixed-parameter tractable parameterized by fc, 5{I), and \D\. □ 

Because of Proposition [U we obtain the following. 

Corollary 1. Let C be a base class consisting of the union of some of the classes MINMAX, MAJ, 
MIN, andMAL. Then Strong C-Backdoor Detection is fixed-parameter tractable for the combined 
parameter size of the backdoor set, size of the domain, and the maximum arity of the given CSP instance. 
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We are now ready to provide analogous results for weak backdoor sets. The proof is very similar 
to the case of strong backdoor sets, however, it becomes slightly simpler, because we do not need to 
consider different operations for different assignments (since there is only one assignment, required for a 
weak backdoor set). We can hence branch on the possible operations before we start the depth-bounded 
search tree procedure, which also results in a slight improvement in the running time of the algorithm. 

Theorem 11. Let W be a tractable polymorphism property. Then Weak Cp-Backdoor Detection 
is fixed-parameter tractable for the combined parameter size of the backdoor set, size of the domain, and 
the maximum arity of the given CSP instance. 

Proof. Let P be a tractable property, and let (I, k) with I = {V, D, C) be an instance of Weak Cp-Back- 
DOOR Detection. Let P be the set of all operations on D that have property P. Then, P can be 
constructed in fpt-time with respect to the size of the domain , because there are at most ’’ cp-ary 

operations on D and for each of them we can test in polynomial time, whether it satisfies 

property P. The algorithm uses a depth-bounded search tree approach to find a weak {^}-backdoor set 
of size at most k for every (f> € P. 

The algorithm works by constructing a search tree for every operation <j) £ P. In T,f, every node 
is labeled by a set B of at most k variables of V. Additionally, every leaf node has a second label, which 
is either Yes or No. T^j, is defined inductively as follows. The root of T,p is labeled by the empty set. 
Furthermore, if t is a node of T^, whose hrst label is B, then the children of t in T,p are obtained as 
follows. If there is an assignment t : B ^ D such that I[t] is closed under <j) and I[t] has a solution , 
then B is a weak P-backdoor set of size at most k, and hence t becomes a leaf node, whose second label 
is Yes. Otherwise, i.e., for every assignment t : B ^ D either I[t] is not closed under or I[t\ has no 
solution, we consider two cases: (1) \B\ = k, then t becomes a leaf node, whose second label is No, and 
(2) \B\ < k, then for every assignment t : B ^ D t has the following children: for every every variable 
V in the scope of some constraint c S I[t\ that is not closed under t has a child whose hrst label is 
B U {u}. 

If there is a, (f> € P such T^j, has a leaf node, whose second label is Yes, then the algorithm returns 
the hrst label of that leaf node. Otherwise the algorithm returns No. This completes the description of 
the algorithm. 

We now show the correctness of the algorithm. First, suppose there is a (j> € P such that the search 
tree built by the algorithm has a leaf node t whose second label is Yes. Here, the algorithm returns 
the hrst label, say B of t. By the construction of T,j,, we obtain that \B\ < k and there is an assignment 
T : B ^ D such that I[t] is closed (j) and I[t] has a solution, as required. 

Now consider the case where the algorithm returns No. We need to show that there is no weak 
P-backdoor set of size at most k for I. Assume, for the sake of contradiction that such a set B exists. 
Because P is a weak P-backdoor set for I there must exist an operation G P and an assignment 
T : B ^ D such that I[t] is closed under and /[r] has a solution. 

Observe that if has a leaf node t whose hrst label is a set B' with B' C B, then the second label 
of t must be Yes. This is because, either \B'\ < A: in which case the second label of t must be Yes, or 
|P'| = A: in which case B' = B and by the dehnition of B it follows that the second label of t must be 
Yes. 

It hence remains to show that Ttj, has a leaf node whose hrst label is a set B' with B' Q B. This will 
complete the proof about the correctness of the algorithm. We will show a slightly stronger statement, 
namely, that for every natural number A, either has a leaf whose hrst label is contained in B or 
has an inner node of distance exactly I from the root whose hrst label is contained in B. We show the 
latter by induction on 1. 

The claim obviously holds for A = 0. So assume that contains a node t at distance £ from the 
root of whose hrst label, say B', is a subset of B. If t is a leaf node of P^, then the claim is shown. 
Otherwise, for every assignment t : B' ^ D either I[t\ is not closed under (j) or I[t] has no solution. 
Let r' : P' —>■ P be the restriction of r to B'. Because I[t] and hence also I[t'] has a solution, we obtain 
that I[t'] is not closed under the operation fi. By the dehnition of the search tree P^, it holds that t 
has a child t' whose hrst label is P' U {u}, for every variable v in the scope of some constraint c G /[t'J 
that is not closed under fi. We claim that Y(c) fl P 0 and hence t has a child, whose hrst label is a 
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subset of B, as required. Indeed, suppose not. Then c € I[t\ a contradiction to our assumption that 
I[t] is closed under (j). This concludes our proof concerning the correctness of the algorithm. 

The running time of the algorithm is obtained as follows. Let T^f, be a search tree obtained by 
the algorithm. Then the running time of the depth-bounded search tree algorithm is 0{\V{T(i,)\) 
times the maximum time that is spent on any node of T^. Since the number of children of any 
node of is bounded by \D\’‘6{I) (recall that S{I) denotes the maximum arity of any constraint 
of I) and the longest path from the root of to some leaf of is bounded by fc -I- 1, we obtain 
that |y(T 0 )| < 0{{\Df6{I)) ). Furthermore, the time required for any node t of T is at most 

0(|ZI|^comp_rest(/, T)|C'(J[T])|check_poly(c, (())check^ol(/[r], (())), where comp_rest(/, r) is the time re¬ 
quired to compute /[r] for some assignment r of at most k variables, check_poly(c, (f>) is the time required 
to check whether a constraint c of /[r] preserves the polymorphism (j), and check_sol(/[T], (/>) is the time 
required to solve I[t] given that it is closed under the operation (j). Observe that comp_rest(/, r) and 
|C'(/[t])| are polynomial in the input size. The same holds for check_poly(c, </)) and check^ol(/[r], 0), 
because ((> is a cp-ary tractable polymorphism. Now, the total running time required by the algorithm 
is the time required to compute the set P plus the time required to compute for every (p G P. 
Putting everything together, we obtain 0{\P\{\D\^S{I))'^^^\D\^n^^^^) = = 

as the total running time of the algorithm, where n denotes the in¬ 
put size of the CSP instance. This shows that Weak Cp-Backdoor Detection is fixed-parameter 
tractable parameterized by k, 6{I), and \D\. □ 

Because of Proposition [TJ we obtain the following. 

Corollary 2. Let C be a base class consisting of the union of some of the classes MINMAX, MAJ, 
MIN, and MAL. Then Weak C-Backdoor Detection is fixed-parameter tractable for the combined 
parameter size of the backdoor set, size of the domain, and the maximum arity of the given CSP instance. 


6 Hardness of Backdoor Detection for CSP 

In this section we show our parameterized hardness results for Strong Cp-Backdoor Detection and 
Weak Cp-Backdoor Detection. In particular, we show that Strong Cp-Backdoor Detection 
and Weak Cp-Backdoor Detection are W[2]-hard parameterized by the size of the backdoor set 
even for CSP instances of Boolean domain and for CSP instances with arity two. We start by showing 
hardness for CSP instances with Boolean domain. 

Theorem 12. Let ¥ be a tractable polymorphism property such that all operations (p with P(((>) are 
idempotent. Then, Strong Cp-Backdoor Detection and Weak Cp-Backdoor Detection are 
W[2]-hard parameterized by the size of the backdoor set, even for CSP instances over the Boolean domain. 

Proof. We start by introducing what we call “Boolean barriers” of operations since they form the basis 
of the proof. Let p : D" —> D be an n-ary operation over D. We say a set A of r(A)-ary tuples over 
{0,1} is a Boolean barrier for p if there is a sequence (ti,... ,tn) of (not necessarily distinct) tuples 
in A such that p{ti,... ,tn) p. A. We call a Boolean barrier X oi p minimal if |A| is minimal over all 
Boolean barriers of p. For an operation p, we denote by X{p) a minimal Boolean barrier of p. For our 
reduction below, we will employ the fact that every operation p with P((()) has a non-empty Boolean 
barrier of finite size. The reason that a Boolean barrier must exist is simply because p is tractable and 
if p would not have a Boolean barrier then every Boolean CSP instance would be closed under p and 
thus tractable, which unless P = NP is not possible. To see that X{p) is finite first note that |A((/))| is 
at most as large as the arity cp of p. Moreover, r(A) < 2°*’ because there are at most 2'^' distinct cp-ary 
tuples over {0,1}. Hence the size of X{p) is at most cp • 2^*. 

We show the theorem via an fpt-reduction from Hitting Set. Given an instance {Q,U,k) of 
Hitting Set, we construct a CSP instance I = (H,{0,11,(7) such that Q has a hitting set of size 
at most k if and only if I has a strong Cp-backdoor set of size at most k. Let A be the set of all 
Boolean barriers of polymorphisms with property P, i.e., A = { A((()) : p has property P}. Note that 
the cardinality of A only depends on cp and can thus be considered to be finite. The variables of I are 


19 


{xu : u £ [/ } U { oi(A, Q),..., Oj.(;y)(A, Q) : X £ A and Q £ Q}. Furthermore, for every Boolean barrier 
A € A with A = {ti, ..., tn} for some natural number n, and Q £ Q with Q = {ui, ..., n|Q|}, C contains 
a constraint R{X,Q) with scope (oi(A, Q),..., Or(A) (A, Q), ,..., a:„|Q|) whose relation contains the 

row 

■ • ■) (* mod2 ,...,z mod 2) 

'-V-' 

|Q|times 

for every i in 1 < i < n. This completes the construction of I. Observe that / is satisfiable, because 
the first rows of every constraint of I are pairwise compatible. In particular, set all the variables of 
the form {xu '■ u £ U} to zero, and set all remaining variables according to their first-row constraints 
— note that these rows are the same for different Q ^ Q' associated with the same Boolean barrier 
A, while on the other hand, for different Boolean barriers A ^ A', the associated variables are different 
and there is no cause for conflict. Suppose that Q has a hitting set B of size at most k. We claim that 
Bu = { Xu ■ u £ B } is a. strong Cp-backdoor set of I. Let t : ^ U be an assignment of the variables 

in B and A G A be a Boolean barrier of maximum cardinality over all Boolean barriers in A. We claim 
that I[t\ is closed under any operation (p with property P with A = X{phi) and hence Bu is a strong 
Cp-backdoor set of I. Because B is a hitting set of Q, it follows that every relation of /[r] contains at 
most half of the tuples of the corresponding relation in I. Furthermore, because p is idempotent (and 
hence every tuple is mapped to itself), it holds that \lambda'\ > 1 for every A' £ A. It follows that 
every relation of I[t\ contains at least one tuple less than the corresponding relation in I. Because of 
the choice of A, we obtain that every constraint c £ C[r] contains less than |A| tuples, and hence I[t] 
is closed under any operation p with property P and A = X{p). Hence, Bu = { Xu ■ u £ B } is a strong 
Cp-backdoor set of I, as required. Towards showing that Bu is also a weak Cp-backdoor set of / consider 
the assignment t : t {0}. It follows from the above argumentation that C[t] is closed under any 

operation p with property P and A = X{p). Furthermore, I[t] is satishable, because the first row of every 
constraint in C is also contained in every constraint of I[t] and these rows are pairwise compatible. 

For the reverse direction, suppose that / has a strong Cp-backdoor set B of size at most k. Because 
for every Q £ Q it holds that the set of constraints { i?(A, Q) : A G A } is not closed under any operation 
p with property P, we obtain that B has to contain at least one variable from IJaeA Q)) 

every Q £ Q. Since the only variables that are shared between ^ Q)) UagA ^ Q')) 

for distinct Q, Q' G Q are the variables in { a;„ : u G 17 }, it follows that Q has a hitting set of size at 
most \B\ < fc, as required. □ 

Because all min/max, majority, minority and Mal’cev operations can be defined via tractable prop¬ 
erties and are idempotent, we obtain. 

Corollary 3. For every C £ {MINMAX, MAJ, MIN, MAL}, Strong C-Backdoor Detection and 
Weak C-Backdoor Detection are W[2]-hard parameterized by the size of the backdoor set, even for 
CSP instances over the Boolean domain. 

In the following we show that hardness also holds if we drop the restriction on the domain of the 
CSP instance but instead consider only CSP instances of arity 2. To do so we need the following lemma. 

Lemma 1. For every C £ {MINMAX, MAJ, MIN, MAL} and every k > 3, there is a 2-ary CSP instance 
I{C, k) with k constraints, each containing the all 0 tuple, such that I(C, k) ^ C but for every assignment 
T of at least one variable of I{C,k), it holds that I(C,k)[T] G C. 

Proof. The proof of this lemma is inspired by the proof of 0, Lemma 3]. For every C as in the statement 
of the lemma, I(C,k) has variables vi,...,V 2 k and contains a constraint Ci with scope {v 2 i-i,V 2 i) for 
every I < i < k. 

If C = MAJ, the domain of I{C,k) is {1,...,3A: — 4}, and for every 1 < i < k the relations 
R{ci) are defined as follows: i?(ci) = {(0, 0), (1, 3), (1,4), (2, 5)}, i?(c 2 ) = {(0,0), (1, 3), (2,4), (1, 5)}, 
i?(c 3 ) = 1(0,0), (2, 3(fc - 2)),(l,3(fc - 2) + l),(2,3(fc - 2) + 2)}, and for every 3 < i < k, R{ci) = 
{(0, 0), (3(f - 3), 3(i - 2)), (3(i - 3)-h 1, 3(i - 2) + 1), (3(i - 3) + 2,3(i - 2) + 2)}. Clearly, the instance 
is binary and every relation contains the all 0 tuple. We first show that I{C,k) is not closed under 
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any majority operation cj). From R{ci) and i?(c2), we obtain (/)(3,4, 5) = 3 and from Rics), we obtain 
— 2), 3(fc — 2) +1,3(fc — 2) + 2) € {3(fc — 2) +1,3(fc — 2) + 2}. Finally, from R{ci) with 3 < i < k,we 
obtain (^(3(z — 2), 3(i — 2) + 1,3(i — 2) + 2) = j if and only if (^(3(i — 1), 3(i — 1) + 1, 3(i — 1) + 2) = j + 3. 
This chain implies /(3,4, 5) G {4,5} in contradiction to /(3,4, 5) = 3. On the other hand, for any 
assignment r that sets at least 1 variable of /(C, fc), it holds that at least one relation of I{C,k)[T] 
contains at most 2 tuples. This relation is then closed under any majority operation and breaks the 
above chain of implications. 

If C = MINMAX, the domain of I{C, k) is {1,..., /c + 1}, and for every 1 < i < k the relations 
R{ci) are defined as follows: Ri = {(0,0), {i,i + 1), {i + 1,1), (i + 1,1 + 1)} for every 1 < i < k and 
Rk = {(0, 0), (1, k + 1), (fc + 1,1), (1,1)}. Clearly, the instance is binary and every relation contains the 
all 0 tuple. Let be a max operation (the proof remains the same if 0 is a min operation). Then, for 
every 1 < i < A:, the relation Ri implies i < i + 1, but the relation Rk implies 1 > A: + 1, a contradiction. 
Hence, the set of all relations i?i,..., Rk is not closed under any max operation. On the other hand, for 
any assignment r that sets at least 1 variable of I(C, k), it holds that at least one relation of I{C, A:)[t] 
contains at most 2 tuples that agree on (at least) one coordinate. This relation is then closed under any 
max operation and breaks the above chain of implications. 

If C = MIN, the domain of I{C, k) is {1,..., 3A: + 2}, and for every 1 < A < A: the relations R{ci) are 
defined as follows: Ri = {(0,0), (1,3), (1,4), (2,5)}, Ri = {(0,0), (3(A -1), 31), (3(* -1) +1,3i +1), (3(1- 
1) + 2, 3* + 2)1 for every 1 < z < A:, and Rk = {(0, 0), (1, 3(A: — 1)), (2, 3(A: — 1) + 1), (1, 3(A: — 1) + 1)}. 
Clearly, the instance is binary and every relation contains the all 0 tuple. Let ^ be a minority operation. 
Then, Ri implies /(3,4, 5) = 5, which together with the relations Ri, for every I < z < A: implies 
/(3z, 3z + 1, 3z + 2) = 3z + 2. In particular, we obtain /(3(A; — I),3(A:—I) + 1,3(A;—l) + 2) = 3(A: — I) + 2, 
which contradicts relation Rk- Hence, the set of all relations Ri,, Rk is not closed under any minority 
operation. On the other hand, for any assignment r that sets at least I variable of I{C, k), it holds that 
at least one relation of /(C,A;)[r] contains at most 2 tuples. This relation is then closed under any 
minority operation and breaks the above chain of implications. If C = MAL, we can use the same set of 
relations. □ 

Theorem 13. For every C G {MINMAX, MAJ, MIN, MAL}, Strong C-Backdoor Detection and 
Weak C-Backdoor Detection are W[2]-hard parameterized by the size of the backdoor set even for 
CSP instances with arity 2. 

Proof. The proof of the theorem is inspired by the proof of [3, Theorem 6]. We show the theorem via a 
fpt-reduction from Hitting Set. Let PI = {U,iF,k) be an instance of Hitting Set. 

We construct a CSP instance I = {V,D,C) of arity 2 such that FI has a hitting set of size at most 
k if and only if I has a strong C-backdoor set of size at most k. The variables of I are {xu : u G 
hi} U {yp : F G T and I < z < |F| }. Furthermore, for every F G J- with F = {m, ...,zz|/7’|}, let I{F) 
be the CSP instance obtained from I{C,\F\) (see Lemma [T|), where for every z with 1 < z < |F|, we 
replace the scope of the constraint Ci with {xui, yff) and adapt I{C, |F|) in such a way that for distinct 
F,F' G F, I{C,\F\) and I{C,\F'\) have no common domain value. Then, the constraints of I is the 
union of the constraints of /(F) over every F G F. This completes the construction of I. Clearly, / has 
arity 2 and is satisfiable, e.g., by the all 0 assignment. 

Suppose that H has a hitting set B of size at most k. We claim that = { Xu : u G B } is a strong 
C-backdoor set of I. We start by showing that is a strong C-backdoor set of I. Let r :/?„—>• D be an 
assignment of the variables in B. Then, it follows from Lemma[T]that the constraints of /(F) [r] are in C. 
In particular, there is an operation (j) (which is either min, max, majority, minority or Mal’cev, depending 
on C) such that the constraints of /(F) [r] are closed under cf. Because for every distinct F, F' G F, 
/(F) [r] and /(F')[t] do not have a common domain value, it follows that there is an operation (which is 
a combination of the operations for each /(F)[t])) such that /[r] is closed under (f and (f> is a min/max, 
majority, minority, or Mal’cev operation (depending on C). Hence, I[t] G C, as required. Towards 
showing that is also a weak C-backdoor set of C consider the assignment r : Bu {0}. It follows 
from the above argumentation that /[r] G C. Furthermore, /[r] is satisfiable, because every constraint 
of /[r] still contains the all 0 tuple. 
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For the reverse direction, suppose that I has a strong C-backdoor set B of size at most k. Because 
for every F S it holds that the set of constraints of I{F) is not in C, we obtain that B has to contain 
at least 1 variable from V{I{F)) for every F G F. Since the only variables that are shared between 
V{I{F)) and V{I{F')) for distinct F,F' G F are the variables in { a;„ : m G W }, it follows that F[ has a 
hitting set of size at most \B\ < k, as required. □ 


7 Comparison to Partition-Backdoors 


In this section we draw a comparis on between our notion of backdoor sets, and the approach recently 
introduced bv iBessiere et ah ( 2013 1. which we will call the partition backdoor set approach. The main 
difference is that the partition backdoor set approach considers a backdoor with respect to a subset of 
constraints th a t is tr actable (because it is closed under some tractable polymorphism). In particular, 


Bessiere et al.l (120131 1 define partition backdoor sets as follows. 


Definition 1. Let ¥ be a tractable polymorphism property. A ¥-partition backdoor of a CSP instance 
I = {V, D, C) is a set B of variables, such that there is a partition of C into Ci and C 2 for which the 
following holds: 


• {V,D,C 2 ) GCv , and 


• If¥is characterized using only idempotent polymorphisms, then B is the set of all variables in the 
scope of the constraints of C 2 , while if¥ is characterized using only conservative polymorphisms, 
then the variables of B are a vertex cover of the primal graph of Ci. 


Note that the primal graph of a set C of constraints is the graph, whose vertices are all the variables 
appearing in the scope of the constraints in C and that has an edge between two variables if and only 
if t hey appear together in the scope of at least one constraint in C. 


Bessiere et al.l (120131 1 show that given a CSP instance and a partition (Ci, (72) as defined above, then 


the detection and evaluation of P-partition backdoor sets are fixed-parameter tractable 


• in the size of the domain and the number of variables in Ci, if P is characterized using only 
idempotent polymorphisms; 

• in the size of the domain and the vertex cover of the primal graph oi Ci, if P is characterized using 
only conservative polymorphisms. 

Observe that the above tractability results only hold when the partition {Ci,C 2 ) is given as a part of 
the input, if it is not then the detection of partition backdoors is fixed-parameter intractable (for most 
combinations of parameters). 

Apart from the complexity of detecting and evaluating backdoors it is also important that the 
backdoor sets are as small as possible over a wide variety of CSP instances. In the following we will 
observe that our backdoor sets are always at most as large as partition backdoors and we exhibit CSP 
instances where our backdoors have size one but the size of a smallest partition backdoor set can be 
arbitrary large. 

Proposition 2. Let ¥ be a tractable polymorphism property. Then, for every CSP instance I the size 
of a smallest strong ¥-backdoor set of I is at most the size of a smallest ¥-partition backdoor set of F 

Proof. This follows from the observation that any P-partition backdoor set is also a strong P-backdoor 
set. □ 


Proposition 3. For any tractable property P and any natural number n , there is a CSP instance that 
has a strong/weak Cp-hackdoor set of size one, but the size of a smallest ¥-partition backdoor set is Ll(ji). 

Proof. Let P be a tractable property and let I = {V, D, C) be a CSP instance such that: 

• no constraint of / is closed under a polymorphism with property P; 
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• all constraints of I share a common variable, say x, such that for any assignment r of x, the CSP 
instance I[t] is closed under some polymorphisms with property P. 

It is straightforward to check that for any number of variables n, such a CSP instance using n variables 
can be easily constructed for every tractable property P that contains at least one operation. Further¬ 
more, the variable x is a strong Cp-backdoor set of I. However, because no constraint of I is closed 
under any operation from P it follows that every partition backdoor has to contain at least all but 1 
variable from every constraint of I. □ 


8 Summary 


We have introduced heterogeneous base classes and have shown that strong backdoor sets into such 
classes can be considerably smaller than strong backdoor sets into homogeneous base classes; never¬ 
theless, the detection of strong backdoor sets into heterogeneous base classes is still fixed-parameter 
tractable in many natural cases. Hence, our results push the tractability boundary considerably further. 
Our theoretical evaluation entails hardness results that establish the limits for tractability. 

Our main result in the context of SAT (Theorem |6l) provides a complete complexity classification 
for strong backdoor set detection to heterogeneous base classes composed of the well-known Schaefer 
classes. We observe that the hardness results rely on clauses of unbounded length and show that if the 
maximum length of the clauses is taken as an additional parameter then strong as well as weak backdoor 
set detection into any combination of the Schaefer classes is fixed-parameter tractable (Theorem [7]). 

Our main result for CSP backdoors ((Theorem [T0)l establishes fixed-parameter tractability for a wide 
range of heterogeneous base classes composed of possibly infinitely many tractable polymorphisms. In 
particular, we show that the detection of strong as well as weak backdoor sets is fixed-parameter tractable 
for the combined parameter backdoor size, domain size, and the maximum arity of constraints. In fact, 
this result entails heterogeneous base classes, as different instantiations of the backdoor variables can lead 
to reduced instances that are closed under different polymorphisms (even polymorphisms of different 
type). We complement our main result with hardness results that show that we lose fixed-parameter 
tractability when we omit either domain size or the maximum arity of constraints from the parameter 
(Theorems [T^ and [T31) . 

It would be interesting to extend our line of research to the study of backdoor sets into heterogeneous 
base classes composed of homogeneous classes defined by e.g., structural restrictions such as bounded 
treewidth 20|, ISTf or bounded hyper-treewidth 2l| , and base classes defined by “hybrid” concepts like 
forbidden patterns Q, in contrast to the Schaefer classes for SAT, and polymorphism-based classes for 
CSP. Another promising direction of future research is to parameterize backdoor sets not by their size 
but by structural properties of the backdoor set and how it interacts with the rest of the instance, 
similar to the parameters that have been considered for modulators into graph classes [ISlil ; some 
first results in this direction have recently been obtained 16|. 
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